Average sentence length |
---|
24.1365 |
sentence length | percentage |
---|---|
2 | 0.0033 |
3 | 0.0467 |
4 | 0.0567 |
5 | 0.1933 |
6 | 0.3533 |
7 | 0.6167 |
8 | 0.8367 |
9 | 1.0900 |
10 | 1.5333 |
11 | 1.9933 |
12 | 2.2600 |
13 | 2.5300 |
14 | 2.9067 |
15 | 3.3333 |
16 | 3.5233 |
17 | 3.7333 |
18 | 4.0467 |
19 | 4.1400 |
20 | 4.1867 |
21 | 4.0533 |
22 | 4.2500 |
23 | 4.1133 |
24 | 3.7900 |
25 | 3.9067 |
26 | 3.8733 |
27 | 3.6633 |
28 | 3.5367 |
29 | 3.3667 |
30 | 3.2800 |
31 | 2.8367 |
32 | 2.7800 |
33 | 2.8300 |
34 | 2.4133 |
35 | 2.2033 |
36 | 2.0267 |
37 | 1.8567 |
38 | 1.6933 |
39 | 1.4900 |
40 | 1.1633 |
41 | 1.0900 |
42 | 0.7833 |
43 | 0.5200 |
44 | 0.3833 |
45 | 0.3133 |
46 | 0.1833 |
47 | 0.1000 |
48 | 0.0767 |
49 | 0.0300 |
50 | 0.0100 |
Next we measure the length of a sentence by its number of words. Again, we are interested in average length and length distribution.
See above.
For simplicity, the number of words in a sentence is calculated as 1+(Number of blanks in the sentence).
Average length:
select avg(1+char_length(sentence)-char_length( replace(sentence," ",""))) from sentences;
Distribution:
SELECT @all:=count(*) from sentences;
select 1+char_length(sentence)-char_length( replace(sentence," ","")), 100*count(*)/@all from sentences where 50>=1+char_length(sentence)-char_length(replace(sentence," ",""))group by 1+char_length(sentence)-char_length( replace( sentence," ",""));
4.2.1 Length of sentences in characters